type belief
- Asia > China > Guangdong Province > Shenzhen (0.04)
- Africa > Zimbabwe (0.04)
- North America > United States > California > Riverside County > Riverside (0.04)
- (2 more...)
- Research Report > Experimental Study (0.93)
- Research Report > New Finding (0.67)
- Leisure & Entertainment > Games (1.00)
- Information Technology (0.92)
- Energy (0.67)
- Asia > China > Guangdong Province > Shenzhen (0.04)
- Africa > Zimbabwe (0.04)
- North America > United States > California > Riverside County > Riverside (0.04)
- (2 more...)
- Research Report > Experimental Study (0.93)
- Research Report > New Finding (0.67)
- Leisure & Entertainment > Games (1.00)
- Information Technology (0.92)
- Energy (0.67)
Safe Exploitative Play with Untrusted Type Beliefs
Li, Tongxin, Handina, Tinashe, Ren, Shaolei, Wierman, Adam
The combination of the Bayesian game and learning has a rich history, with the idea of controlling a single agent in a system composed of multiple agents with unknown behaviors given a set of types, each specifying a possible behavior for the other agents. The idea is to plan an agent's own actions with respect to those types which it believes are most likely to maximize the payoff. However, the type beliefs are often learned from past actions and likely to be incorrect. With this perspective in mind, we consider an agent in a game with type predictions of other components, and investigate the impact of incorrect beliefs to the agent's payoff. In particular, we formally define a tradeoff between risk and opportunity by comparing the payoff obtained against the optimal payoff, which is represented by a gap caused by trusting or distrusting the learned beliefs. Our main results characterize the tradeoff by establishing upper and lower bounds on the Pareto front for both normal-form and stochastic Bayesian games, with numerical results provided.
- Asia > Middle East > Jordan (0.04)
- Africa > Zimbabwe (0.04)
- North America > United States > California > Riverside County > Riverside (0.04)
- (2 more...)
Eliciting Honest Reputation Feedback in a Markov Setting
Witkowski, Jens (Albert-Ludwigs-Universität Freiburg)
Recently, online reputation mechanisms have been proposed that reward agents for honest feedback about products and services with fixed quality. Many real-world settings, however, are inherently dynamic. As an example, consider a web service that wishes to publish the expected download speed of a file mirrored on different server sites. In contrast to the models of Miller, Resnick and Zeckhauser and of Jurca and Faltings, the quality of the service (e. g., a server’s available bandwidth) changes over time and future agents are solely interested in the present quality levels. We show that hidden Markov models (HMM) provide natural generalizations of these static models and design a payment scheme that elicits honest reports from the agents after they have experienced the quality of the service.
- Europe > Germany > Baden-Württemberg > Freiburg (0.04)
- North America > United States > Pennsylvania (0.04)